Overview

Dataset statistics

Number of variables17
Number of observations5008
Missing cells1
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory665.3 KiB
Average record size in memory136.0 B

Variable types

DateTime1
Text14
Categorical2

Alerts

crew_aboard is highly overall correlated with crew_fatalitiesHigh correlation
crew_fatalities is highly overall correlated with crew_aboardHigh correlation

Reproduction

Analysis started2023-12-06 14:51:50.409184
Analysis finished2023-12-06 14:52:07.328447
Duration16.92 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

fecha
Date

Distinct4577
Distinct (%)91.4%
Missing0
Missing (%)0.0%
Memory size39.3 KiB
Minimum1908-09-17 00:00:00
Maximum2021-07-06 00:00:00
2023-12-06T11:52:08.116613image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-06T11:52:08.956097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct1217
Distinct (%)24.3%
Missing0
Missing (%)0.0%
Memory size39.3 KiB
2023-12-06T11:52:10.549119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length4
Mean length3.1565495
Min length1

Characters and Unicode

Total characters15808
Distinct characters16
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique585 ?
Unique (%)11.7%

Sample

1st row1718
2nd row?
3rd row0630
4th row?
5th row1830
ValueCountFrequency (%)
1504
29.8%
c 36
 
0.7%
1500 35
 
0.7%
1100 30
 
0.6%
1400 30
 
0.6%
1700 29
 
0.6%
1200 28
 
0.6%
1600 28
 
0.6%
0800 26
 
0.5%
1900 25
 
0.5%
Other values (1189) 3273
64.9%
2023-12-06T11:52:12.616899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 3748
23.7%
1 2976
18.8%
2 1545
9.8%
? 1504
9.5%
5 1383
 
8.7%
3 1298
 
8.2%
4 1004
 
6.4%
9 557
 
3.5%
8 538
 
3.4%
7 524
 
3.3%
Other values (6) 731
 
4.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 14006
88.6%
Other Punctuation 1723
 
10.9%
Lowercase Letter 38
 
0.2%
Space Separator 36
 
0.2%
Uppercase Letter 5
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 3748
26.8%
1 2976
21.2%
2 1545
11.0%
5 1383
 
9.9%
3 1298
 
9.3%
4 1004
 
7.2%
9 557
 
4.0%
8 538
 
3.8%
7 524
 
3.7%
6 433
 
3.1%
Other Punctuation
ValueCountFrequency (%)
? 1504
87.3%
: 218
 
12.7%
; 1
 
0.1%
Lowercase Letter
ValueCountFrequency (%)
c 38
100.0%
Space Separator
ValueCountFrequency (%)
36
100.0%
Uppercase Letter
ValueCountFrequency (%)
Z 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 15765
99.7%
Latin 43
 
0.3%

Most frequent character per script

Common
ValueCountFrequency (%)
0 3748
23.8%
1 2976
18.9%
2 1545
9.8%
? 1504
9.5%
5 1383
 
8.8%
3 1298
 
8.2%
4 1004
 
6.4%
9 557
 
3.5%
8 538
 
3.4%
7 524
 
3.3%
Other values (4) 688
 
4.4%
Latin
ValueCountFrequency (%)
c 38
88.4%
Z 5
 
11.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 15808
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 3748
23.7%
1 2976
18.8%
2 1545
9.8%
? 1504
9.5%
5 1383
 
8.7%
3 1298
 
8.2%
4 1004
 
6.4%
9 557
 
3.5%
8 538
 
3.4%
7 524
 
3.3%
Other values (6) 731
 
4.6%

Ruta
Text

Distinct4125
Distinct (%)82.4%
Missing0
Missing (%)0.0%
Memory size39.3 KiB
2023-12-06T11:52:13.576187image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length72
Median length49
Mean length20.792931
Min length1

Characters and Unicode

Total characters104131
Distinct characters91
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3687 ?
Unique (%)73.6%

Sample

1st rowFort Myer, Virginia
2nd rowJuvisy-sur-Orge, France
3rd rowAtlantic City, New Jersey
4th rowVictoria, British Columbia, Canada
5th rowOver the North Sea
ValueCountFrequency (%)
near 1350
 
9.2%
off 350
 
2.4%
russia 255
 
1.7%
new 229
 
1.6%
brazil 176
 
1.2%
colombia 153
 
1.0%
canada 131
 
0.9%
france 127
 
0.9%
california 117
 
0.8%
mexico 113
 
0.8%
Other values (4153) 11657
79.5%
2023-12-06T11:52:15.529979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 13037
 
12.5%
9703
 
9.3%
e 7073
 
6.8%
i 6567
 
6.3%
n 6545
 
6.3%
r 6035
 
5.8%
o 5367
 
5.2%
, 5210
 
5.0%
l 4000
 
3.8%
s 3530
 
3.4%
Other values (81) 37064
35.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 74113
71.2%
Uppercase Letter 14738
 
14.2%
Space Separator 9704
 
9.3%
Other Punctuation 5362
 
5.1%
Dash Punctuation 105
 
0.1%
Decimal Number 66
 
0.1%
Control 21
 
< 0.1%
Close Punctuation 11
 
< 0.1%
Open Punctuation 11
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 13037
17.6%
e 7073
9.5%
i 6567
8.9%
n 6545
8.8%
r 6035
 
8.1%
o 5367
 
7.2%
l 4000
 
5.4%
s 3530
 
4.8%
t 3112
 
4.2%
u 2756
 
3.7%
Other values (31) 16091
21.7%
Uppercase Letter
ValueCountFrequency (%)
N 2032
13.8%
C 1456
 
9.9%
S 1145
 
7.8%
M 999
 
6.8%
B 952
 
6.5%
A 920
 
6.2%
P 787
 
5.3%
I 720
 
4.9%
R 652
 
4.4%
O 588
 
4.0%
Other values (17) 4487
30.4%
Decimal Number
ValueCountFrequency (%)
0 24
36.4%
1 15
22.7%
2 9
 
13.6%
5 8
 
12.1%
8 3
 
4.5%
3 2
 
3.0%
7 2
 
3.0%
9 2
 
3.0%
6 1
 
1.5%
Other Punctuation
ValueCountFrequency (%)
, 5210
97.2%
. 115
 
2.1%
' 24
 
0.4%
/ 6
 
0.1%
? 5
 
0.1%
& 1
 
< 0.1%
: 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
9703
> 99.9%
  1
 
< 0.1%
Control
ValueCountFrequency (%)
16
76.2%
5
 
23.8%
Dash Punctuation
ValueCountFrequency (%)
- 105
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 88851
85.3%
Common 15280
 
14.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 13037
14.7%
e 7073
 
8.0%
i 6567
 
7.4%
n 6545
 
7.4%
r 6035
 
6.8%
o 5367
 
6.0%
l 4000
 
4.5%
s 3530
 
4.0%
t 3112
 
3.5%
u 2756
 
3.1%
Other values (58) 30829
34.7%
Common
ValueCountFrequency (%)
9703
63.5%
, 5210
34.1%
. 115
 
0.8%
- 105
 
0.7%
0 24
 
0.2%
' 24
 
0.2%
16
 
0.1%
1 15
 
0.1%
) 11
 
0.1%
( 11
 
0.1%
Other values (13) 46
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 104089
> 99.9%
None 42
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 13037
 
12.5%
9703
 
9.3%
e 7073
 
6.8%
i 6567
 
6.3%
n 6545
 
6.3%
r 6035
 
5.8%
o 5367
 
5.2%
, 5210
 
5.0%
l 4000
 
3.8%
s 3530
 
3.4%
Other values (64) 37022
35.6%
None
ValueCountFrequency (%)
é 14
33.3%
ö 5
 
11.9%
í 4
 
9.5%
ó 4
 
9.5%
ï 2
 
4.8%
á 2
 
4.8%
à 1
 
2.4%
ô 1
 
2.4%
è 1
 
2.4%
ä 1
 
2.4%
Other values (7) 7
16.7%
Distinct2268
Distinct (%)45.3%
Missing0
Missing (%)0.0%
Memory size39.3 KiB
2023-12-06T11:52:16.630298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length65
Median length47
Mean length18.921725
Min length1

Characters and Unicode

Total characters94760
Distinct characters87
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1734 ?
Unique (%)34.6%

Sample

1st rowMilitary - U.S. Army
2nd row?
3rd rowMilitary - U.S. Navy
4th rowPrivate
5th rowMilitary - German Navy
ValueCountFrequency (%)
air 1481
 
10.3%
971
 
6.7%
airlines 840
 
5.8%
military 778
 
5.4%
force 557
 
3.9%
airways 453
 
3.1%
u.s 302
 
2.1%
aeroflot 265
 
1.8%
lines 184
 
1.3%
royal 152
 
1.1%
Other values (2079) 8422
58.5%
2023-12-06T11:52:18.635059image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 10212
 
10.8%
9421
 
9.9%
r 8849
 
9.3%
a 7786
 
8.2%
e 6780
 
7.2%
n 5528
 
5.8%
A 5083
 
5.4%
o 4380
 
4.6%
l 4079
 
4.3%
s 4000
 
4.2%
Other values (77) 28642
30.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 68181
72.0%
Uppercase Letter 15071
 
15.9%
Space Separator 9422
 
9.9%
Dash Punctuation 939
 
1.0%
Other Punctuation 879
 
0.9%
Open Punctuation 115
 
0.1%
Close Punctuation 115
 
0.1%
Decimal Number 30
 
< 0.1%
Control 8
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 10212
15.0%
r 8849
13.0%
a 7786
11.4%
e 6780
9.9%
n 5528
8.1%
o 4380
6.4%
l 4079
 
6.0%
s 4000
 
5.9%
t 3921
 
5.8%
c 1996
 
2.9%
Other values (28) 10650
15.6%
Uppercase Letter
ValueCountFrequency (%)
A 5083
33.7%
M 1217
 
8.1%
S 1138
 
7.6%
C 910
 
6.0%
F 901
 
6.0%
T 679
 
4.5%
L 661
 
4.4%
U 534
 
3.5%
P 513
 
3.4%
N 496
 
3.3%
Other values (16) 2939
19.5%
Decimal Number
ValueCountFrequency (%)
0 5
16.7%
7 4
13.3%
4 4
13.3%
1 3
10.0%
2 3
10.0%
5 3
10.0%
6 2
 
6.7%
8 2
 
6.7%
9 2
 
6.7%
3 2
 
6.7%
Other Punctuation
ValueCountFrequency (%)
. 718
81.7%
/ 109
 
12.4%
' 25
 
2.8%
? 11
 
1.3%
, 10
 
1.1%
& 6
 
0.7%
Space Separator
ValueCountFrequency (%)
9421
> 99.9%
  1
 
< 0.1%
Control
ValueCountFrequency (%)
6
75.0%
2
 
25.0%
Dash Punctuation
ValueCountFrequency (%)
- 939
100.0%
Open Punctuation
ValueCountFrequency (%)
( 115
100.0%
Close Punctuation
ValueCountFrequency (%)
) 115
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 83252
87.9%
Common 11508
 
12.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 10212
12.3%
r 8849
 
10.6%
a 7786
 
9.4%
e 6780
 
8.1%
n 5528
 
6.6%
A 5083
 
6.1%
o 4380
 
5.3%
l 4079
 
4.9%
s 4000
 
4.8%
t 3921
 
4.7%
Other values (54) 22634
27.2%
Common
ValueCountFrequency (%)
9421
81.9%
- 939
 
8.2%
. 718
 
6.2%
( 115
 
1.0%
) 115
 
1.0%
/ 109
 
0.9%
' 25
 
0.2%
? 11
 
0.1%
, 10
 
0.1%
& 6
 
0.1%
Other values (13) 39
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 94637
99.9%
None 123
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 10212
 
10.8%
9421
 
10.0%
r 8849
 
9.4%
a 7786
 
8.2%
e 6780
 
7.2%
n 5528
 
5.8%
A 5083
 
5.4%
o 4380
 
4.6%
l 4079
 
4.3%
s 4000
 
4.2%
Other values (64) 28519
30.1%
None
ValueCountFrequency (%)
é 102
82.9%
á 5
 
4.1%
à 2
 
1.6%
ï 2
 
1.6%
ó 2
 
1.6%
í 2
 
1.6%
ç 2
 
1.6%
ã 1
 
0.8%
ú 1
 
0.8%
ê 1
 
0.8%
Other values (3) 3
 
2.4%
Distinct893
Distinct (%)17.8%
Missing0
Missing (%)0.0%
Memory size39.3 KiB
2023-12-06T11:52:19.780819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length1
Mean length1.5928514
Min length1

Characters and Unicode

Total characters7977
Distinct characters47
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique656 ?
Unique (%)13.1%

Sample

1st row?
2nd row?
3rd row?
4th row?
5th row?
ValueCountFrequency (%)
3728
74.1%
1 11
 
0.2%
101 10
 
0.2%
6 8
 
0.2%
4 7
 
0.1%
901 7
 
0.1%
115 6
 
0.1%
301 6
 
0.1%
201 6
 
0.1%
703 6
 
0.1%
Other values (883) 1235
 
24.6%
2023-12-06T11:52:21.672325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
? 3683
46.2%
1 638
 
8.0%
0 497
 
6.2%
2 495
 
6.2%
3 417
 
5.2%
5 385
 
4.8%
4 347
 
4.4%
6 330
 
4.1%
7 316
 
4.0%
8 291
 
3.6%
Other values (37) 578
 
7.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3986
50.0%
Other Punctuation 3715
46.6%
Uppercase Letter 156
 
2.0%
Dash Punctuation 87
 
1.1%
Space Separator 22
 
0.3%
Lowercase Letter 11
 
0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A 21
13.5%
S 14
 
9.0%
H 13
 
8.3%
P 11
 
7.1%
F 10
 
6.4%
C 10
 
6.4%
U 8
 
5.1%
R 7
 
4.5%
I 7
 
4.5%
L 7
 
4.5%
Other values (15) 48
30.8%
Decimal Number
ValueCountFrequency (%)
1 638
16.0%
0 497
12.5%
2 495
12.4%
3 417
10.5%
5 385
9.7%
4 347
8.7%
6 330
8.3%
7 316
7.9%
8 291
7.3%
9 270
6.8%
Lowercase Letter
ValueCountFrequency (%)
n 2
18.2%
a 2
18.2%
r 2
18.2%
o 1
9.1%
y 1
9.1%
h 1
9.1%
t 1
9.1%
e 1
9.1%
Other Punctuation
ValueCountFrequency (%)
? 3683
99.1%
/ 32
 
0.9%
Dash Punctuation
ValueCountFrequency (%)
- 87
100.0%
Space Separator
ValueCountFrequency (%)
22
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7810
97.9%
Latin 167
 
2.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
A 21
 
12.6%
S 14
 
8.4%
H 13
 
7.8%
P 11
 
6.6%
F 10
 
6.0%
C 10
 
6.0%
U 8
 
4.8%
R 7
 
4.2%
I 7
 
4.2%
L 7
 
4.2%
Other values (23) 59
35.3%
Common
ValueCountFrequency (%)
? 3683
47.2%
1 638
 
8.2%
0 497
 
6.4%
2 495
 
6.3%
3 417
 
5.3%
5 385
 
4.9%
4 347
 
4.4%
6 330
 
4.2%
7 316
 
4.0%
8 291
 
3.7%
Other values (4) 411
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7977
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
? 3683
46.2%
1 638
 
8.0%
0 497
 
6.2%
2 495
 
6.2%
3 417
 
5.2%
5 385
 
4.8%
4 347
 
4.4%
6 330
 
4.1%
7 316
 
4.0%
8 291
 
3.6%
Other values (37) 578
 
7.2%

route
Text

Distinct3838
Distinct (%)76.7%
Missing1
Missing (%)< 0.1%
Memory size39.3 KiB
2023-12-06T11:52:22.660714image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length59
Median length52
Mean length18.948472
Min length1

Characters and Unicode

Total characters94875
Distinct characters92
Distinct categories10 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3630 ?
Unique (%)72.5%

Sample

1st rowDemonstration
2nd rowAir show
3rd rowTest flight
4th row?
5th row?
ValueCountFrequency (%)
5395
30.7%
city 213
 
1.2%
new 149
 
0.8%
san 140
 
0.8%
york 117
 
0.7%
paris 116
 
0.7%
training 103
 
0.6%
de 101
 
0.6%
london 88
 
0.5%
moscow 84
 
0.5%
Other values (3627) 11082
63.0%
2023-12-06T11:52:24.676897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12646
 
13.3%
a 9833
 
10.4%
n 5568
 
5.9%
o 5503
 
5.8%
i 5244
 
5.5%
e 5107
 
5.4%
- 4927
 
5.2%
r 4487
 
4.7%
l 3420
 
3.6%
s 3075
 
3.2%
Other values (82) 35065
37.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 62474
65.8%
Uppercase Letter 12941
 
13.6%
Space Separator 12647
 
13.3%
Dash Punctuation 4931
 
5.2%
Other Punctuation 1827
 
1.9%
Control 30
 
< 0.1%
Decimal Number 16
 
< 0.1%
Final Punctuation 4
 
< 0.1%
Open Punctuation 3
 
< 0.1%
Close Punctuation 2
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 9833
15.7%
n 5568
 
8.9%
o 5503
 
8.8%
i 5244
 
8.4%
e 5107
 
8.2%
r 4487
 
7.2%
l 3420
 
5.5%
s 3075
 
4.9%
t 3006
 
4.8%
u 2566
 
4.1%
Other values (30) 14665
23.5%
Uppercase Letter
ValueCountFrequency (%)
C 1232
 
9.5%
B 1140
 
8.8%
S 1081
 
8.4%
A 1046
 
8.1%
M 1042
 
8.1%
P 823
 
6.4%
L 788
 
6.1%
T 710
 
5.5%
K 640
 
4.9%
N 630
 
4.9%
Other values (18) 3809
29.4%
Decimal Number
ValueCountFrequency (%)
9 3
18.8%
1 3
18.8%
4 3
18.8%
2 2
12.5%
7 2
12.5%
8 1
 
6.2%
6 1
 
6.2%
0 1
 
6.2%
Other Punctuation
ValueCountFrequency (%)
, 915
50.1%
? 768
42.0%
. 98
 
5.4%
/ 20
 
1.1%
' 20
 
1.1%
: 5
 
0.3%
\ 1
 
0.1%
Space Separator
ValueCountFrequency (%)
12646
> 99.9%
  1
 
< 0.1%
Dash Punctuation
ValueCountFrequency (%)
- 4927
99.9%
– 4
 
0.1%
Control
ValueCountFrequency (%)
29
96.7%
1
 
3.3%
Final Punctuation
ValueCountFrequency (%)
’ 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 75415
79.5%
Common 19460
 
20.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 9833
 
13.0%
n 5568
 
7.4%
o 5503
 
7.3%
i 5244
 
7.0%
e 5107
 
6.8%
r 4487
 
5.9%
l 3420
 
4.5%
s 3075
 
4.1%
t 3006
 
4.0%
u 2566
 
3.4%
Other values (58) 27606
36.6%
Common
ValueCountFrequency (%)
12646
65.0%
- 4927
 
25.3%
, 915
 
4.7%
? 768
 
3.9%
. 98
 
0.5%
29
 
0.1%
/ 20
 
0.1%
' 20
 
0.1%
: 5
 
< 0.1%
– 4
 
< 0.1%
Other values (14) 28
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 94746
99.9%
None 121
 
0.1%
Punctuation 8
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
12646
 
13.3%
a 9833
 
10.4%
n 5568
 
5.9%
o 5503
 
5.8%
i 5244
 
5.5%
e 5107
 
5.4%
- 4927
 
5.2%
r 4487
 
4.7%
l 3420
 
3.6%
s 3075
 
3.2%
Other values (63) 34936
36.9%
None
ValueCountFrequency (%)
é 38
31.4%
í 21
17.4%
á 15
 
12.4%
ó 14
 
11.6%
ü 6
 
5.0%
ã 6
 
5.0%
ç 4
 
3.3%
è 4
 
3.3%
ÃŽ 3
 
2.5%
ö 2
 
1.7%
Other values (7) 8
 
6.6%
Punctuation
ValueCountFrequency (%)
– 4
50.0%
’ 4
50.0%
Distinct2469
Distinct (%)49.3%
Missing0
Missing (%)0.0%
Memory size39.3 KiB
2023-12-06T11:52:26.000082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length42
Median length36
Mean length18.496006
Min length1

Characters and Unicode

Total characters92628
Distinct characters78
Distinct categories12 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1863 ?
Unique (%)37.2%

Sample

1st rowWright Flyer III
2nd rowWright Byplane
3rd rowDirigible
4th rowCurtiss seaplane
5th rowZeppelin L-1 (airship)
ValueCountFrequency (%)
douglas 1130
 
8.3%
boeing 418
 
3.1%
dc-3 387
 
2.8%
lockheed 332
 
2.4%
de 294
 
2.2%
havilland 292
 
2.1%
antonov 288
 
2.1%
canada 159
 
1.2%
otter 146
 
1.1%
ilyushin 142
 
1.0%
Other values (2525) 10038
73.7%
2023-12-06T11:52:28.695412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8649
 
9.3%
- 5180
 
5.6%
e 4842
 
5.2%
o 4638
 
5.0%
a 4636
 
5.0%
n 3856
 
4.2%
l 3696
 
4.0%
i 3486
 
3.8%
r 3306
 
3.6%
C 3034
 
3.3%
Other values (68) 47305
51.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 46427
50.1%
Uppercase Letter 17900
 
19.3%
Decimal Number 13808
 
14.9%
Space Separator 8650
 
9.3%
Dash Punctuation 5180
 
5.6%
Other Punctuation 277
 
0.3%
Open Punctuation 190
 
0.2%
Close Punctuation 189
 
0.2%
Math Symbol 3
 
< 0.1%
Control 2
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 4842
10.4%
o 4638
10.0%
a 4636
10.0%
n 3856
 
8.3%
l 3696
 
8.0%
i 3486
 
7.5%
r 3306
 
7.1%
s 2917
 
6.3%
t 2357
 
5.1%
u 2217
 
4.8%
Other values (18) 10476
22.6%
Uppercase Letter
ValueCountFrequency (%)
C 3034
16.9%
D 2819
15.7%
A 1901
10.6%
B 1728
9.7%
H 1016
 
5.7%
L 883
 
4.9%
F 796
 
4.4%
S 790
 
4.4%
I 642
 
3.6%
T 620
 
3.5%
Other values (16) 3671
20.5%
Decimal Number
ValueCountFrequency (%)
2 2167
15.7%
0 2103
15.2%
1 2017
14.6%
3 1706
12.4%
4 1704
12.3%
7 1494
10.8%
6 875
6.3%
5 713
 
5.2%
8 664
 
4.8%
9 365
 
2.6%
Other Punctuation
ValueCountFrequency (%)
/ 185
66.8%
. 76
27.4%
? 13
 
4.7%
, 2
 
0.7%
& 1
 
0.4%
Space Separator
ValueCountFrequency (%)
8649
> 99.9%
  1
 
< 0.1%
Dash Punctuation
ValueCountFrequency (%)
- 5180
100.0%
Open Punctuation
ValueCountFrequency (%)
( 190
100.0%
Close Punctuation
ValueCountFrequency (%)
) 189
100.0%
Math Symbol
ValueCountFrequency (%)
+ 3
100.0%
Control
ValueCountFrequency (%)
2
100.0%
Initial Punctuation
ValueCountFrequency (%)
‘ 1
100.0%
Final Punctuation
ValueCountFrequency (%)
’ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 64327
69.4%
Common 28301
30.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 4842
 
7.5%
o 4638
 
7.2%
a 4636
 
7.2%
n 3856
 
6.0%
l 3696
 
5.7%
i 3486
 
5.4%
r 3306
 
5.1%
C 3034
 
4.7%
s 2917
 
4.5%
D 2819
 
4.4%
Other values (44) 27097
42.1%
Common
ValueCountFrequency (%)
8649
30.6%
- 5180
18.3%
2 2167
 
7.7%
0 2103
 
7.4%
1 2017
 
7.1%
3 1706
 
6.0%
4 1704
 
6.0%
7 1494
 
5.3%
6 875
 
3.1%
5 713
 
2.5%
Other values (14) 1693
 
6.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 92609
> 99.9%
None 17
 
< 0.1%
Punctuation 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8649
 
9.3%
- 5180
 
5.6%
e 4842
 
5.2%
o 4638
 
5.0%
a 4636
 
5.0%
n 3856
 
4.2%
l 3696
 
4.0%
i 3486
 
3.8%
r 3306
 
3.6%
C 3034
 
3.3%
Other values (63) 47286
51.1%
None
ValueCountFrequency (%)
é 12
70.6%
è 4
 
23.5%
  1
 
5.9%
Punctuation
ValueCountFrequency (%)
‘ 1
50.0%
’ 1
50.0%
Distinct4701
Distinct (%)93.9%
Missing0
Missing (%)0.0%
Memory size39.3 KiB
2023-12-06T11:52:29.993952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length6
Mean length6.1956869
Min length1

Characters and Unicode

Total characters31028
Distinct characters49
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4665 ?
Unique (%)93.2%

Sample

1st row?
2nd rowSC1
3rd row?
4th row?
5th row?
ValueCountFrequency (%)
311
 
6.1%
hk 4
 
0.1%
49 3
 
0.1%
f-aeej 2
 
< 0.1%
32 2
 
< 0.1%
82 2
 
< 0.1%
53 2
 
< 0.1%
cf-tcl 2
 
< 0.1%
12406 2
 
< 0.1%
f-bbdm 2
 
< 0.1%
Other values (4732) 4772
93.5%
2023-12-06T11:52:31.929749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 3497
 
11.3%
C 2022
 
6.5%
A 1711
 
5.5%
1 1541
 
5.0%
N 1432
 
4.6%
2 1246
 
4.0%
P 1193
 
3.8%
4 1187
 
3.8%
5 1132
 
3.6%
0 1098
 
3.5%
Other values (39) 14969
48.2%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 15946
51.4%
Decimal Number 11081
35.7%
Dash Punctuation 3497
 
11.3%
Other Punctuation 391
 
1.3%
Space Separator 90
 
0.3%
Control 12
 
< 0.1%
Lowercase Letter 10
 
< 0.1%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
C 2022
 
12.7%
A 1711
 
10.7%
N 1432
 
9.0%
P 1193
 
7.5%
B 718
 
4.5%
F 690
 
4.3%
H 636
 
4.0%
T 611
 
3.8%
E 560
 
3.5%
G 559
 
3.5%
Other values (16) 5814
36.5%
Decimal Number
ValueCountFrequency (%)
1 1541
13.9%
2 1246
11.2%
4 1187
10.7%
5 1132
10.2%
0 1098
9.9%
3 1037
9.4%
6 1026
9.3%
7 1015
9.2%
8 912
8.2%
9 887
8.0%
Lowercase Letter
ValueCountFrequency (%)
l 5
50.0%
y 1
 
10.0%
e 1
 
10.0%
o 1
 
10.0%
w 1
 
10.0%
d 1
 
10.0%
Other Punctuation
ValueCountFrequency (%)
? 277
70.8%
/ 114
29.2%
Control
ValueCountFrequency (%)
10
83.3%
2
 
16.7%
Dash Punctuation
ValueCountFrequency (%)
- 3497
100.0%
Space Separator
ValueCountFrequency (%)
90
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 15956
51.4%
Common 15072
48.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
C 2022
 
12.7%
A 1711
 
10.7%
N 1432
 
9.0%
P 1193
 
7.5%
B 718
 
4.5%
F 690
 
4.3%
H 636
 
4.0%
T 611
 
3.8%
E 560
 
3.5%
G 559
 
3.5%
Other values (22) 5824
36.5%
Common
ValueCountFrequency (%)
- 3497
23.2%
1 1541
10.2%
2 1246
 
8.3%
4 1187
 
7.9%
5 1132
 
7.5%
0 1098
 
7.3%
3 1037
 
6.9%
6 1026
 
6.8%
7 1015
 
6.7%
8 912
 
6.1%
Other values (7) 1381
 
9.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 31028
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 3497
 
11.3%
C 2022
 
6.5%
A 1711
 
5.5%
1 1541
 
5.0%
N 1432
 
4.6%
2 1246
 
4.0%
P 1193
 
3.8%
4 1187
 
3.8%
5 1132
 
3.6%
0 1098
 
3.5%
Other values (39) 14969
48.2%

cn_ln
Text

Distinct3908
Distinct (%)78.0%
Missing0
Missing (%)0.0%
Memory size39.3 KiB
2023-12-06T11:52:33.070044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length19
Mean length4.9239217
Min length1

Characters and Unicode

Total characters24659
Distinct characters44
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3608 ?
Unique (%)72.0%

Sample

1st row1
2nd row?
3rd row?
4th row?
5th row?
ValueCountFrequency (%)
724
 
14.1%
1 10
 
0.2%
4 9
 
0.2%
125 7
 
0.1%
3 7
 
0.1%
30 7
 
0.1%
229 6
 
0.1%
2 5
 
0.1%
18 5
 
0.1%
213 5
 
0.1%
Other values (3928) 4359
84.7%
2023-12-06T11:52:34.923823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 3485
14.1%
0 3141
12.7%
2 2641
10.7%
4 2366
9.6%
3 2343
9.5%
5 1861
7.5%
6 1593
6.5%
9 1582
6.4%
7 1577
6.4%
8 1537
6.2%
Other values (34) 2533
10.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 22126
89.7%
Other Punctuation 1380
 
5.6%
Uppercase Letter 582
 
2.4%
Dash Punctuation 430
 
1.7%
Space Separator 136
 
0.6%
Control 3
 
< 0.1%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A 125
21.5%
B 65
11.2%
C 62
10.7%
S 55
9.5%
T 45
 
7.7%
H 32
 
5.5%
U 26
 
4.5%
G 20
 
3.4%
N 20
 
3.4%
E 17
 
2.9%
Other values (14) 115
19.8%
Decimal Number
ValueCountFrequency (%)
1 3485
15.8%
0 3141
14.2%
2 2641
11.9%
4 2366
10.7%
3 2343
10.6%
5 1861
8.4%
6 1593
7.2%
9 1582
7.1%
7 1577
7.1%
8 1537
6.9%
Other Punctuation
ValueCountFrequency (%)
/ 699
50.7%
? 679
49.2%
: 1
 
0.1%
. 1
 
0.1%
Control
ValueCountFrequency (%)
2
66.7%
1
33.3%
Dash Punctuation
ValueCountFrequency (%)
- 430
100.0%
Space Separator
ValueCountFrequency (%)
136
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 24077
97.6%
Latin 582
 
2.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
A 125
21.5%
B 65
11.2%
C 62
10.7%
S 55
9.5%
T 45
 
7.7%
H 32
 
5.5%
U 26
 
4.5%
G 20
 
3.4%
N 20
 
3.4%
E 17
 
2.9%
Other values (14) 115
19.8%
Common
ValueCountFrequency (%)
1 3485
14.5%
0 3141
13.0%
2 2641
11.0%
4 2366
9.8%
3 2343
9.7%
5 1861
7.7%
6 1593
6.6%
9 1582
6.6%
7 1577
6.5%
8 1537
6.4%
Other values (10) 1951
8.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 24659
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 3485
14.1%
0 3141
12.7%
2 2641
10.7%
4 2366
9.6%
3 2343
9.5%
5 1861
7.5%
6 1593
6.5%
9 1582
6.4%
7 1577
6.4%
8 1537
6.2%
Other values (34) 2533
10.3%
Distinct245
Distinct (%)4.9%
Missing0
Missing (%)0.0%
Memory size39.3 KiB
2023-12-06T11:52:35.948194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length2
Mean length1.7360224
Min length1

Characters and Unicode

Total characters8694
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique68 ?
Unique (%)1.4%

Sample

1st row2
2nd row1
3rd row5
4th row1
5th row20
ValueCountFrequency (%)
3 280
 
5.6%
2 246
 
4.9%
4 202
 
4.0%
5 190
 
3.8%
10 179
 
3.6%
6 174
 
3.5%
7 164
 
3.3%
1 139
 
2.8%
9 130
 
2.6%
11 128
 
2.6%
Other values (235) 3176
63.4%
2023-12-06T11:52:37.740086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 2009
23.1%
2 1411
16.2%
3 1042
12.0%
4 832
9.6%
5 694
 
8.0%
6 607
 
7.0%
7 570
 
6.6%
0 540
 
6.2%
8 504
 
5.8%
9 468
 
5.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 8677
99.8%
Other Punctuation 17
 
0.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 2009
23.2%
2 1411
16.3%
3 1042
12.0%
4 832
9.6%
5 694
 
8.0%
6 607
 
7.0%
7 570
 
6.6%
0 540
 
6.2%
8 504
 
5.8%
9 468
 
5.4%
Other Punctuation
ValueCountFrequency (%)
? 17
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 8694
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 2009
23.1%
2 1411
16.2%
3 1042
12.0%
4 832
9.6%
5 694
 
8.0%
6 607
 
7.0%
7 570
 
6.6%
0 540
 
6.2%
8 504
 
5.8%
9 468
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8694
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 2009
23.1%
2 1411
16.2%
3 1042
12.0%
4 832
9.6%
5 694
 
8.0%
6 607
 
7.0%
7 570
 
6.6%
0 540
 
6.2%
8 504
 
5.8%
9 468
 
5.4%
Distinct235
Distinct (%)4.7%
Missing0
Missing (%)0.0%
Memory size39.3 KiB
2023-12-06T11:52:38.712485image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length2
Mean length1.5988419
Min length1

Characters and Unicode

Total characters8007
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique68 ?
Unique (%)1.4%

Sample

1st row1
2nd row0
3rd row0
4th row0
5th row?
ValueCountFrequency (%)
0 869
 
17.4%
221
 
4.4%
4 170
 
3.4%
2 162
 
3.2%
5 140
 
2.8%
7 130
 
2.6%
3 130
 
2.6%
10 128
 
2.6%
9 128
 
2.6%
8 126
 
2.5%
Other values (225) 2804
56.0%
2023-12-06T11:52:40.471844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 1687
21.1%
0 1282
16.0%
2 1026
12.8%
3 730
9.1%
4 692
8.6%
5 599
 
7.5%
7 473
 
5.9%
6 465
 
5.8%
9 416
 
5.2%
8 416
 
5.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 7786
97.2%
Other Punctuation 221
 
2.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 1687
21.7%
0 1282
16.5%
2 1026
13.2%
3 730
9.4%
4 692
8.9%
5 599
 
7.7%
7 473
 
6.1%
6 465
 
6.0%
9 416
 
5.3%
8 416
 
5.3%
Other Punctuation
ValueCountFrequency (%)
? 221
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 8007
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 1687
21.1%
0 1282
16.0%
2 1026
12.8%
3 730
9.1%
4 692
8.6%
5 599
 
7.5%
7 473
 
5.9%
6 465
 
5.8%
9 416
 
5.2%
8 416
 
5.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8007
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 1687
21.1%
0 1282
16.0%
2 1026
12.8%
3 730
9.1%
4 692
8.6%
5 599
 
7.5%
7 473
 
5.9%
6 465
 
5.8%
9 416
 
5.2%
8 416
 
5.2%

crew_aboard
Categorical

HIGH CORRELATION 

Distinct35
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size39.3 KiB
3
954 
2
828 
4
694 
1
535 
5
514 
Other values (30)
1483 

Length

Max length2
Median length1
Mean length1.0698882
Min length1

Characters and Unicode

Total characters5358
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9 ?
Unique (%)0.2%

Sample

1st row1
2nd row1
3rd row5
4th row1
5th row?

Common Values

ValueCountFrequency (%)
3 954
19.0%
2 828
16.5%
4 694
13.9%
1 535
10.7%
5 514
10.3%
6 375
 
7.5%
7 244
 
4.9%
? 219
 
4.4%
8 173
 
3.5%
9 115
 
2.3%
Other values (25) 357
 
7.1%

Length

2023-12-06T11:52:41.193402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
3 954
19.0%
2 828
16.5%
4 694
13.9%
1 535
10.7%
5 514
10.3%
6 375
 
7.5%
7 244
 
4.9%
219
 
4.4%
8 173
 
3.5%
9 115
 
2.3%
Other values (25) 357
 
7.1%

Most occurring characters

ValueCountFrequency (%)
3 993
18.5%
1 920
17.2%
2 902
16.8%
4 727
13.6%
5 538
10.0%
6 389
 
7.3%
7 252
 
4.7%
? 219
 
4.1%
8 181
 
3.4%
9 126
 
2.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5139
95.9%
Other Punctuation 219
 
4.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 993
19.3%
1 920
17.9%
2 902
17.6%
4 727
14.1%
5 538
10.5%
6 389
 
7.6%
7 252
 
4.9%
8 181
 
3.5%
9 126
 
2.5%
0 111
 
2.2%
Other Punctuation
ValueCountFrequency (%)
? 219
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5358
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 993
18.5%
1 920
17.2%
2 902
16.8%
4 727
13.6%
5 538
10.0%
6 389
 
7.3%
7 252
 
4.7%
? 219
 
4.1%
8 181
 
3.4%
9 126
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5358
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 993
18.5%
1 920
17.2%
2 902
16.8%
4 727
13.6%
5 538
10.0%
6 389
 
7.3%
7 252
 
4.7%
? 219
 
4.1%
8 181
 
3.4%
9 126
 
2.4%
Distinct200
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size39.3 KiB
2023-12-06T11:52:42.271736image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length2
Mean length1.5842652
Min length1

Characters and Unicode

Total characters7934
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique51 ?
Unique (%)1.0%

Sample

1st row1
2nd row1
3rd row5
4th row1
5th row14
ValueCountFrequency (%)
1 384
 
7.7%
2 377
 
7.5%
3 363
 
7.2%
4 242
 
4.8%
5 235
 
4.7%
6 176
 
3.5%
7 160
 
3.2%
10 159
 
3.2%
13 132
 
2.6%
9 128
 
2.6%
Other values (190) 2652
53.0%
2023-12-06T11:52:43.901723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 1943
24.5%
2 1370
17.3%
3 990
12.5%
4 743
 
9.4%
5 618
 
7.8%
0 511
 
6.4%
6 486
 
6.1%
7 484
 
6.1%
8 421
 
5.3%
9 360
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 7926
99.9%
Other Punctuation 8
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 1943
24.5%
2 1370
17.3%
3 990
12.5%
4 743
 
9.4%
5 618
 
7.8%
0 511
 
6.4%
6 486
 
6.1%
7 484
 
6.1%
8 421
 
5.3%
9 360
 
4.5%
Other Punctuation
ValueCountFrequency (%)
? 8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7934
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 1943
24.5%
2 1370
17.3%
3 990
12.5%
4 743
 
9.4%
5 618
 
7.8%
0 511
 
6.4%
6 486
 
6.1%
7 484
 
6.1%
8 421
 
5.3%
9 360
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7934
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 1943
24.5%
2 1370
17.3%
3 990
12.5%
4 743
 
9.4%
5 618
 
7.8%
0 511
 
6.4%
6 486
 
6.1%
7 484
 
6.1%
8 421
 
5.3%
9 360
 
4.5%
Distinct191
Distinct (%)3.8%
Missing0
Missing (%)0.0%
Memory size39.3 KiB
2023-12-06T11:52:44.904104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length1
Mean length1.4620607
Min length1

Characters and Unicode

Total characters7322
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique50 ?
Unique (%)1.0%

Sample

1st row1
2nd row0
3rd row0
4th row0
5th row?
ValueCountFrequency (%)
0 1040
20.8%
1 308
 
6.2%
2 263
 
5.3%
235
 
4.7%
3 193
 
3.9%
4 185
 
3.7%
5 139
 
2.8%
6 133
 
2.7%
7 126
 
2.5%
8 126
 
2.5%
Other values (181) 2260
45.1%
2023-12-06T11:52:46.766956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 1582
21.6%
0 1367
18.7%
2 974
13.3%
3 669
9.1%
4 569
 
7.8%
5 488
 
6.7%
6 405
 
5.5%
7 388
 
5.3%
8 334
 
4.6%
9 311
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 7087
96.8%
Other Punctuation 235
 
3.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 1582
22.3%
0 1367
19.3%
2 974
13.7%
3 669
9.4%
4 569
 
8.0%
5 488
 
6.9%
6 405
 
5.7%
7 388
 
5.5%
8 334
 
4.7%
9 311
 
4.4%
Other Punctuation
ValueCountFrequency (%)
? 235
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 7322
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 1582
21.6%
0 1367
18.7%
2 974
13.3%
3 669
9.1%
4 569
 
7.8%
5 488
 
6.7%
6 405
 
5.5%
7 388
 
5.3%
8 334
 
4.6%
9 311
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7322
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 1582
21.6%
0 1367
18.7%
2 974
13.3%
3 669
9.1%
4 569
 
7.8%
5 488
 
6.7%
6 405
 
5.5%
7 388
 
5.3%
8 334
 
4.6%
9 311
 
4.2%

crew_fatalities
Categorical

HIGH CORRELATION 

Distinct29
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size39.3 KiB
2
892 
3
824 
1
771 
4
591 
5
402 
Other values (24)
1528 

Length

Max length2
Median length1
Mean length1.0463259
Min length1

Characters and Unicode

Total characters5240
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)0.1%

Sample

1st row0
2nd row0
3rd row5
4th row1
5th row?

Common Values

ValueCountFrequency (%)
2 892
17.8%
3 824
16.5%
1 771
15.4%
4 591
11.8%
5 402
8.0%
0 400
8.0%
6 273
 
5.5%
? 235
 
4.7%
7 171
 
3.4%
8 130
 
2.6%
Other values (19) 319
 
6.4%

Length

2023-12-06T11:52:47.506498image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2 892
17.8%
3 824
16.5%
1 771
15.4%
4 591
11.8%
5 402
8.0%
0 400
8.0%
6 273
 
5.5%
235
 
4.7%
7 171
 
3.4%
8 130
 
2.6%
Other values (19) 319
 
6.4%

Most occurring characters

ValueCountFrequency (%)
1 1026
19.6%
2 940
17.9%
3 857
16.4%
4 615
11.7%
0 471
9.0%
5 415
7.9%
6 278
 
5.3%
? 235
 
4.5%
7 178
 
3.4%
8 133
 
2.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5005
95.5%
Other Punctuation 235
 
4.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 1026
20.5%
2 940
18.8%
3 857
17.1%
4 615
12.3%
0 471
9.4%
5 415
8.3%
6 278
 
5.6%
7 178
 
3.6%
8 133
 
2.7%
9 92
 
1.8%
Other Punctuation
ValueCountFrequency (%)
? 235
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5240
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 1026
19.6%
2 940
17.9%
3 857
16.4%
4 615
11.7%
0 471
9.0%
5 415
7.9%
6 278
 
5.3%
? 235
 
4.5%
7 178
 
3.4%
8 133
 
2.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5240
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 1026
19.6%
2 940
17.9%
3 857
16.4%
4 615
11.7%
0 471
9.0%
5 415
7.9%
6 278
 
5.3%
? 235
 
4.5%
7 178
 
3.4%
8 133
 
2.5%

ground
Text

Distinct52
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size39.3 KiB
2023-12-06T11:52:48.041163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length1
Mean length1.0167732
Min length1

Characters and Unicode

Total characters5092
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)0.5%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
0 4716
94.2%
1 63
 
1.3%
44
 
0.9%
2 34
 
0.7%
3 21
 
0.4%
4 16
 
0.3%
5 12
 
0.2%
7 10
 
0.2%
8 9
 
0.2%
10 6
 
0.1%
Other values (42) 77
 
1.5%
2023-12-06T11:52:49.495268image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 4731
92.9%
1 104
 
2.0%
2 62
 
1.2%
? 44
 
0.9%
3 40
 
0.8%
4 34
 
0.7%
5 28
 
0.5%
7 18
 
0.4%
8 14
 
0.3%
6 9
 
0.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5048
99.1%
Other Punctuation 44
 
0.9%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 4731
93.7%
1 104
 
2.1%
2 62
 
1.2%
3 40
 
0.8%
4 34
 
0.7%
5 28
 
0.6%
7 18
 
0.4%
8 14
 
0.3%
6 9
 
0.2%
9 8
 
0.2%
Other Punctuation
ValueCountFrequency (%)
? 44
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5092
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 4731
92.9%
1 104
 
2.0%
2 62
 
1.2%
? 44
 
0.9%
3 40
 
0.8%
4 34
 
0.7%
5 28
 
0.5%
7 18
 
0.4%
8 14
 
0.3%
6 9
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5092
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 4731
92.9%
1 104
 
2.0%
2 62
 
1.2%
? 44
 
0.9%
3 40
 
0.8%
4 34
 
0.7%
5 28
 
0.5%
7 18
 
0.4%
8 14
 
0.3%
6 9
 
0.2%
Distinct4858
Distinct (%)97.0%
Missing0
Missing (%)0.0%
Memory size39.3 KiB
2023-12-06T11:52:50.513368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length2669
Median length791
Mean length220.77376
Min length1

Characters and Unicode

Total characters1105635
Distinct characters102
Distinct categories14 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4813 ?
Unique (%)96.1%

Sample

1st rowDuring a demonstration flight, a U.S. Army flyer flown by Orville Wright nose-dived into the ground from a height of approximately 75 feet, killing Lt. Thomas E. Selfridge, 26, who was a passenger. This was the first recorded airplane fatality in history. One of two propellers separated in flight, tearing loose the wires bracing the rudder and causing the loss of control of the aircraft. Orville Wright suffered broken ribs, pelvis and a leg. Selfridge suffered a crushed skull and died a short time later.
2nd rowEugene Lefebvre was the first pilot to ever be killed in an air accident, after his controls jambed while flying in an air show.
3rd rowFirst U.S. dirigible Akron exploded just offshore at an altitude of 1,000 ft. during a test flight.
4th rowThe first fatal airplane accident in Canada occurred when American barnstormer, John M. Bryant, California aviator was killed.
5th rowThe airship flew into a thunderstorm and encountered a severe downdraft crashing 20 miles north of Helgoland Island into the sea. The ship broke in two and the control car immediately sank drowning its occupants.
ValueCountFrequency (%)
the 18463
 
10.1%
of 5544
 
3.0%
a 5456
 
3.0%
and 5444
 
3.0%
to 5429
 
3.0%
in 3682
 
2.0%
crashed 3386
 
1.8%
was 2779
 
1.5%
aircraft 2557
 
1.4%
into 2360
 
1.3%
Other values (11568) 128035
69.9%
2023-12-06T11:52:52.445030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
179362
16.2%
e 104905
 
9.5%
t 81905
 
7.4%
a 79924
 
7.2%
n 68116
 
6.2%
i 65870
 
6.0%
r 63437
 
5.7%
o 62600
 
5.7%
h 42794
 
3.9%
s 39810
 
3.6%
Other values (92) 316912
28.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 869373
78.6%
Space Separator 179369
 
16.2%
Uppercase Letter 25294
 
2.3%
Other Punctuation 20683
 
1.9%
Decimal Number 8853
 
0.8%
Dash Punctuation 1645
 
0.1%
Close Punctuation 158
 
< 0.1%
Open Punctuation 140
 
< 0.1%
Final Punctuation 67
 
< 0.1%
Control 33
 
< 0.1%
Other values (4) 20
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 104905
12.1%
t 81905
 
9.4%
a 79924
 
9.2%
n 68116
 
7.8%
i 65870
 
7.6%
r 63437
 
7.3%
o 62600
 
7.2%
h 42794
 
4.9%
s 39810
 
4.6%
d 38411
 
4.4%
Other values (30) 221601
25.5%
Uppercase Letter
ValueCountFrequency (%)
T 5796
22.9%
C 2775
11.0%
A 2579
10.2%
S 1531
 
6.1%
F 1286
 
5.1%
M 1207
 
4.8%
I 1063
 
4.2%
P 960
 
3.8%
W 924
 
3.7%
N 861
 
3.4%
Other values (16) 6312
25.0%
Other Punctuation
ValueCountFrequency (%)
. 13487
65.2%
, 5721
27.7%
' 771
 
3.7%
" 362
 
1.8%
/ 170
 
0.8%
? 59
 
0.3%
: 56
 
0.3%
; 34
 
0.2%
& 17
 
0.1%
% 3
 
< 0.1%
Other values (2) 3
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
0 2668
30.1%
1 1368
15.5%
2 1042
 
11.8%
5 830
 
9.4%
3 820
 
9.3%
4 578
 
6.5%
6 432
 
4.9%
7 416
 
4.7%
8 386
 
4.4%
9 313
 
3.5%
Space Separator
ValueCountFrequency (%)
179362
> 99.9%
  7
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 157
99.4%
] 1
 
0.6%
Open Punctuation
ValueCountFrequency (%)
( 139
99.3%
[ 1
 
0.7%
Control
ValueCountFrequency (%)
32
97.0%
1
 
3.0%
Dash Punctuation
ValueCountFrequency (%)
- 1645
100.0%
Final Punctuation
ValueCountFrequency (%)
’ 67
100.0%
Math Symbol
ValueCountFrequency (%)
+ 7
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 7
100.0%
Other Symbol
ValueCountFrequency (%)
° 3
100.0%
Initial Punctuation
ValueCountFrequency (%)
‘ 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 894667
80.9%
Common 210968
 
19.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 104905
11.7%
t 81905
 
9.2%
a 79924
 
8.9%
n 68116
 
7.6%
i 65870
 
7.4%
r 63437
 
7.1%
o 62600
 
7.0%
h 42794
 
4.8%
s 39810
 
4.4%
d 38411
 
4.3%
Other values (56) 246895
27.6%
Common
ValueCountFrequency (%)
179362
85.0%
. 13487
 
6.4%
, 5721
 
2.7%
0 2668
 
1.3%
- 1645
 
0.8%
1 1368
 
0.6%
2 1042
 
0.5%
5 830
 
0.4%
3 820
 
0.4%
' 771
 
0.4%
Other values (26) 3254
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1105493
> 99.9%
None 72
 
< 0.1%
Punctuation 70
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
179362
16.2%
e 104905
 
9.5%
t 81905
 
7.4%
a 79924
 
7.2%
n 68116
 
6.2%
i 65870
 
6.0%
r 63437
 
5.7%
o 62600
 
5.7%
h 42794
 
3.9%
s 39810
 
3.6%
Other values (74) 316770
28.7%
Punctuation
ValueCountFrequency (%)
’ 67
95.7%
‘ 3
 
4.3%
None
ValueCountFrequency (%)
é 20
27.8%
á 15
20.8%
í 8
 
11.1%
  7
 
9.7%
ó 3
 
4.2%
° 3
 
4.2%
ö 3
 
4.2%
ð 2
 
2.8%
ü 2
 
2.8%
ã 2
 
2.8%
Other values (6) 7
 
9.7%

Correlations

2023-12-06T11:52:52.930044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
crew_aboardcrew_fatalities
crew_aboard1.0000.755
crew_fatalities0.7551.000

Missing values

2023-12-06T11:52:05.071483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-06T11:52:06.596905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

fechaHORA declaradaRutaOperadORflight_norouteac_typeregistrationcn_lnall_aboardPASAJEROS A BORDOcrew_aboardcantidad de fallecidospassenger_fatalitiescrew_fatalitiesgroundsummary
0September 17, 19081718Fort Myer, VirginiaMilitary - U.S. Army?DemonstrationWright Flyer III?12111100During a demonstration flight, a U.S. Army flyer flown by Orville Wright nose-dived into the ground from a height of approximately 75 feet, killing Lt. Thomas E. Selfridge, 26, who was a passenger. This was the first recorded airplane fatality in history. One of two propellers separated in flight, tearing loose the wires bracing the rudder and causing the loss of control of the aircraft. Orville Wright suffered broken ribs, pelvis and a leg. Selfridge suffered a crushed skull and died a short time later.
1September 07, 1909?Juvisy-sur-Orge, France??Air showWright ByplaneSC1?1011000Eugene Lefebvre was the first pilot to ever be killed in an air accident, after his controls jambed while flying in an air show.
2July 12, 19120630Atlantic City, New JerseyMilitary - U.S. Navy?Test flightDirigible??5055050First U.S. dirigible Akron exploded just offshore at an altitude of 1,000 ft. during a test flight.
3August 06, 1913?Victoria, British Columbia, CanadaPrivate??Curtiss seaplane??1011010The first fatal airplane accident in Canada occurred when American barnstormer, John M. Bryant, California aviator was killed.
4September 09, 19131830Over the North SeaMilitary - German Navy??Zeppelin L-1 (airship)??20??14??0The airship flew into a thunderstorm and encountered a severe downdraft crashing 20 miles north of Helgoland Island into the sea. The ship broke in two and the control car immediately sank drowning its occupants.
5October 17, 19131030Near Johannisthal, GermanyMilitary - German Navy??Zeppelin L-2 (airship)??28??28??0Hydrogen gas which was being vented was sucked into the forward engine and ignited causing the airship to explode and burn at 3,000 ft..German Navy's Zeppelin airships L-4 and L-5 were blown out to sea in February 1915, never to be seen again.
6March 05, 19150100Tienen, BelgiumMilitary - German Navy??Zeppelin L-8 (airship)??41041170170Crashed into trees while attempting to land after being shot down by British and French aircraft.
7September 03, 19151520Off Cuxhaven, GermanyMilitary - German Navy??Zeppelin L-10 (airship)??19??19??0Exploded and burned near Neuwerk Island, when hydrogen gas, being vented, was ignited by lightning.
8July 28, 1916?Near Jambol, BulgeriaMilitary - German Army??Schutte-Lanz S-L-10 (airship)??20??20??0Crashed near the Black Sea, cause unknown.
9September 24, 19160100Billericay, EnglandMilitary - German Navy??Zeppelin L-32 (airship)??22??22??0Shot down by British aircraft crashing in flames.
fechaHORA declaradaRutaOperadORflight_norouteac_typeregistrationcn_lnall_aboardPASAJEROS A BORDOcrew_aboardcantidad de fallecidospassenger_fatalitiescrew_fatalitiesgroundsummary
4998August 07, 20201914Calicut, IndiaAir India ExppressIX344Dubai - CalicutBoeing 737-8HGVT-AXH36323/21081901846201820The flight IX344 suffered a runway excursion while landing at Kozhikode-Calicut Airport in heavy rain. The nose section separated from the fuselage after going down a steep slope at the end of the runway. The pilot and copilot were among the dead. Low visibility, wet runway, low cloud base and poor braking action possibly contributed to the accident.
4999August 22, 20200840Juba, South SudanSouth West Aviaiton?Juba - WauAntonov 26BEX-126115088537430The cargo plane lost height shortly after departure from Juba Airport and impacted a farm near Hai Referendum about 3nm southwest of the airport. One passenger survived in critical condition. The plane was chartered by the World Food Program to transport supplies and wages to Wau and Aweil.
5000September 25, 20202050Near Chuguev, UkraineMilitary - Ukraine Air Force?TrainingAntonov An26SH76 yellow560827207261970The military transport, crashed 1.2 miles from Chuguev air base. The plane was carrying cadets from a nearby air force university on a training flight. The crew may have reported failure of an engine prior to the accident.
5001January 09, 20211440Near Jakarta, IndonesiaSriwijaya AirSJ182Jakarta - PontianakBoeing 737-524PK-CLC27323/261662566625660Sriwijaya Air flight 182 was climbing through 10,900 ft., 11 nm north of Jakarta-Soekarno-Hatta International Airport, over the Java Sea when radar and radio contact was lost. The aircraft then lost height rapidly and impacted the Java Sea. Debris was located near Lancang Island.
5002March 02, 20211705Pieri, SudanSouth Sudan Supreme Airlines?Pieri - YuaiLet L-410UVP-EHK-4274902525108210820One of the engines on the aircraft failed 10 minutes after takeof. When the plane turned back, the second engine failed.
5003March 28, 20211835Near Butte, AlaskaSoloy Helicopters?Sightseeing CharterEurocopter AS350B3 EcureuilN351SH45986515410The sightseeing helicopter crashed after missing the top of a 6,000 ft mountain by just 10 - 15 ft. The crash site was near Knik glacier. The pilot, and four others were killed including Czech billionaire Petr Kellner.
5004May 21, 20211800Near Kaduna, NigeriaMilitary - Nigerian Air Force??Beechcraft B300 King Air 350iNAF203FL-891117411740While on final approach, in poor weather conditions, the aircraft crashed and burst into flames less than 10 km from Kaduna Airport. All 11 occupants were killed, incuding General Ibrahim Attahiru, Chief of Staff of the Nigerian Army.
5005June 10, 20210800Near Pyin Oo Lwin, MyanmarMilitary - Myanmar Air Force?Naypyidaw - AnisakanBeechcraft 1900D4610E-32514122121110The plane was carrying military personnel and monks when it crashed about 300 meters from a steel plant in the Mandalay region. The plane was attempting to land in poor weather conditions and broke into three pieces.
5006July 04, 202111:30Patikul, Sulu, PhilippinesMilitary - Philippine Air Force?Cagayan de Oro-Lumbia - JoloLockheed C-130H Hercules512551259688850??3While attempting to land at Jolo Airport, the military transport overran the runway, struck two houses and burst into flames coming to rest on a coconut plantation.
5007July 06, 20211500Palana, RussiaKamchatka Aviation Enterprise251Petropavlovsk - PalanaAntonov An 26B-100RA-260851231028226282260The passenger plane crashed into the top of a cliff while attempting to land in inclement weather. The debris fell into the sea. Contact was lost with the plane 10 minutes before it was to land.